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SEQUENCE LISTING 

5 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Sato, Takaaki 

10 (ii) TITLE OF INVENTION: TREX, A NOVEL GENE OF TRAF- INTERACTING 

EXT GENE FAMILY AND DIAGNOSTIC AND THERAPEUTIC USES 
THEREOF 

f=i (iii) NUMBER OF SEQUENCES: 37 

S 15 . 

Z. (iv) CORRESPONDENCE ADDRESS: 

^ (A) ADDRESSEE: Cooper & Dunham LLP 

Li (B) STREET: 1185 Avenue of the Americas 

%0 (C) CITY: New York 

,q 2 0 (D) STATE: New York 

Jn (E) COUNTRY: U.S. A 

jj* ( F) ZIP: 10036 

= (v) COMPUTER READABLE FORM: 

Q 25 (A) MEDIUM TYPE: Floppy disk 

Qj (B) COMPUTER: IBM PC compatible 

fT (C) OPERATING SYSTEM: PC- DOS/MS-DOS 

L_ (D) SOFTWARE: Patentln Release #1.0, Version #1.30 

O 3 0 (vi) CURRENT APPLICATION DATA: 

M= (A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

35 (viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: White, John P. 

(B) REGISTRATION NUMBER: 28,678 

(C) REFERENCE /DOCKET NUMBER: 0575/51902-A-PCT 

4 0 (ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (212) 278-0400 
<B) TELEFAX: (212) 391-0525 



45(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3479 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



55 



(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 458.. 3211 



60 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



CCTGATCGTT 


GGTAGTGGCA 


TGGAGGACGG 


GGCTGGCATT 


TCAGACTGCC 


AGCTGTTTTT 


60 


ACCAGCCGCT 


GCATCACTTG 


AATAGAAGCT 


AT GCATATT G 


GCTGGCCGAC 


AAAGC CAAGG 


120 


S GACAAAAGCT 


ATGGCCGTTA 


AAATGGTCCC 


TCTGAGTCCA 


GGGCTCTTTC 


CCTGGCTTTT 


180 


AGCACCATGG 


ATCTCTTCCT 


TTT CATC CCA 


TCAGCAATGT 


GGTACCTTCT 


TCTACTTGAT 


240 


GATGACAGCT 

10 

AAGAT GAAAT 


GATACTTCAG 


ATTTGCCTGA 


CTAAGGTTAG 


AAAC CTGAAT 


CGCTGTGAGG 


300 


TTCCATTTTA 


CTTGGTGCCT 


TGTGCAGGGA 


GCACACTGAT 


CCTTCCAGAA 


360 


ACTTGTGTGT 


GAAAAGAGGT 


TGCGTTTTGT 


CAGACAGACT 


CATGGTTATG 


GCGAGCGATC 


420 


1 5 CGACGTGATC 


AGAGTGGGCA 


AGAGGCACAG. 


CGAACTC ATG ACA GGC TAT ACC ATG 


475 



Met Thr Gly Tyr Thr Met 

1 5 

TTG CGG AAT GGG GGA GTG GGG AAC GGT GGT CAG ACC TGT ATG CTG CGC 523 

2 0Leu Arg Asn Gly Gly Val Gly Asn Gly Gly Gin Thr Cys Met Leu Arg 

10 15 20 

TGG TCC AAT CGC ATC CGG CTG ACA TGG CTG AGT TTC ACG CTG TTC ATC 571 
Trp Ser Asn Arg lie Arg Leu Thr Trp Leu Ser Phe Thr Leu Phe lie 
25 25 30 35 

ATC CTC GTC TTC TTC CCC CTC ATT GCT CAC TAT TAC CTC ACC ACT CTG 619 

lie Leu Val Phe Phe Pro Leu lie Ala His Tyr Tyr Leu Thr Thr Leu 

40 45 50 

30 

GAC GAG GCA GAC GAG GCT GGC AAG CGC ATC TTC GGC CCT CGG GCT GGC 667 

Asp Glu Ala Asp Glu Ala Gly Lys Arg lie Phe Gly Pro Arg Ala Gly 

55 60 65 70 

3 5 AGT GAG CTC TGT GAG GTA AAG CAT GTC CTT GAT CTC TGT CGG ATT CGT 715 

Ser Glu Leu Cys Glu Val Lys His Val Leu Asp Leu Cys Arg lie Arg 
75 80 85 

GAG TCT GTG AGC GAA GAG CTT CTA CAG CTC GAA GCC AAG CGG CAG GAG 7 63 

4 0Glu Ser Val Ser Glu Glu Leu Leu Gin Leu Glu Ala Lys Arg Gin Glu 

90 95 100 



CTG AAC AGC GAG ATT GCC 
Leu Asn Ser Glu lie Ala 
45 105 

AAG AGC ATA GAG AAT GCC 

Lys Ser lie Glu Asn Ala 
120 

50 

ATT AGC CAG ACA GAG CAC 

lie Ser Gin Thr Glu His 

135 140 

55CCC AAA CTG TCC CTG CCC 
Pro Lys Leu Ser Leu Pro 
155 

GGC CTT CCA CCC CCC AAG 
60Gly Leu Pro Pro Pro Lys 
170 



AAG CTG AAC CTC AAG ATT 
Lys Leu Asn Leu Lys He 
110 

AAG CAG GAC CTG CTG CAG 
Lys Gin Asp Leu Leu Gin 
125 130 

TCC TAC AAG GAG CTG ATG 
Ser Tyr Lys Glu Leu Met 
145 

ATC CGA CTG CTC CCT GAG 
He Arg Leu Leu Pro Glu 
160 

GTC ACT CGG GGT TGC CGC 
Val Thr Arg Gly Cys Arg 
175 



GAA GCC TGT AAG 811 

Glu Ala Cys Lys 

115 

CTC AAG AAT GTC 8 59 

Leu Lys Asn Val 



GCC CAG AAC CAG 907 
Ala Gin Asn Gin 
150 

AAG GAC GAT GCC 955 
Lys Asp Asp Ala 
165 

CTT CAC AAC TGC 1003 
Leu His Asn Cys 
180 



TTT GAT TAC TCT CGT TGT CCT CTG ACG TCT GGC TTT CCC GTC TAC GTC 



1051 



ffi 



Phe Asp Tyr Ser Arg- Cys Pro Leu Thr Ser Gly Phe Pro Val Tyr Val 
185 190 195 

TAT GAC AGT GAC CAG TTT GCC TTT GGG AGC TAC CTG GAC CCT TTG GTC 1099 
5Tyr Asp Ser Asp Gin Phe Ala Phe Gly Ser Tyr Leu Asp Pro Leu Val 
200 205 210 

AAG CAG GCT TTT CAG GCT ACA GTG AGA GCC AAC GTT TAT GTT ACA GAA 1147 
Lys Gin Ala Phe Gin Ala Thr Val Arg Ala Asn Val Tyr Val Thr Glu 
10215 220 225 230 

AAT GCG GCC ATC GCC TGC CTG TAT GTG GTG TTA GTG GGA GAA ATG CAA 1195 
Asn Ala Ala lie Ala Cys Leu Tyr Val Val Leu Val Gly Glu Met Gin 
235 240 245 

15 

GAG CCC ACT GTG CTG CGG CCT GCC GAC CTT GAA AAG CAG CTG TTT TCT 1243 
Glu Pro Thr Val Leu Arg Pro Ala Asp Leu Glu Lys Gin Leu Phe Ser 
250 255 260 

2 0 CTG CCA CAC TGG AGG ACA GAT GGG CAC AAC CAC GTC ATT ATC AAC CTG 12 91 

Leu Pro His Trp Arg Thr Asp Gly His Asn His Val lie lie Asn Leu 
2.65 270 275 

TCC CGG AAG TCA GAC ACA CAG AAT CTA CTG TAC AAC GTC AGT ACA GGC 1339 
2 5Ser Arg Lys Ser Asp Thr Gin Asn Leu Leu Tyr Asn Val Ser Thr Gly 
280 285 290 

CGC CAT GTG GCC CAG TCC ACC CTC TAT GCT GCC CAG TAC AGA GCT GGC 1387 
Arg His Val Ala Gin Ser Thr Leu Tyr Ala Ala Gin Tyr Arg Ala Gly 
30295 300 305 310 

TTT GAC CTG GTC GTG TCA CCC CTT GTC CAT GCT ATG TCT GAA CCC AAC 1435 

Phe Asp Leu Val Val Ser Pro Leu Val His Ala Met Ser Glu Pro Asn 
315 320 325 

35 

TTC ATG GAA ATC CCA CCG CAG GTG CCA GTT AAG CGG AAA TAT CTC TTC 1483 

Phe Met Glu lie Pro Pro Gin Val Pro Val Lys Arg Lys Tyr Leu Phe 

330 335 340 

4 0 ACT TTC CAG GGC GAG AAG ATC GAG TCT CTG AGA TCT AGC CTT CAG GAG 1531 
Thr Phe Gin Gly Glu Lys lie Glu Ser Leu Arg Ser Ser Leu Gin Glu 
345 350 355 

GCC CGT TCC TTC GAG GAA GAG ATG GAG GGC GAC CCT CCG GCC GAC TAT 157 9 

4 5Ala Arg Ser Phe Glu Glu Glu Met Glu Gly Asp Pro Pro Ala Asp Tyr 
360 365 370 

GAC GAT CGC ATC ATT GCC ACC CTA AAG GCT GTA CAG GAC AGC AAG CTG 1627 
Asp Asp Arg lie lie Ala Thr Leu Lys Ala Val Gin Asp Ser Lys Leu 
50375 3 80 385 390 

GAT CAG GTG CTG GTA GAA TTC ACT TGC AAA AAC CAG CCG AAG CCT AGC 1675 

Asp Gin Val Leu Val Glu Phe Thr Cys Lys Asn Gin Pro Lys Pro Ser 

395 400 405 

55 

CTG CCG ACT GAG TGG GCA CTG TGT GGG GAG CGG GAA GAC CGC CTG GAG 1723 

Leu Pro Thr Glu Trp Ala Leu Cys Gly Glu Arg Glu Asp Arg Leu Glu 
410 415 420 

6 0TTA CTG AAG CTC TCC ACC TTC GCC CTC ATC ATC ACT CCC GGG GAC CCG 1771 
Leu Leu Lys Leu Ser Thr Phe Ala Leu He He Thr Pro Gly Asp Pro 
425 430 435 



CGC CTG CTC ATT TCA TCT GGG TGT GCC ACG CGG CTC TTC GAG GCC CTG 
Arg Leu Leu lie Ser Ser Gly Cys Ala Thr Arg Leu Phe Glu Ala Leu 
440 445 450 



5 GAG GTG GGG GCC GTG 
Glu val Gly Ala Val 
455 

TAC CAC GAC ATG CTG 
lOTyr His Asp Met Leu 

475 

CCT CGC GTC ACA GAG 
Pro Arg Val Thr Glu 
15 490 

GAT CTG TTG GCC ATG 
Asp Leu Leu Ala Met 
505 

20 

TTC TCC ACC GCA GAC 
Phe Ser Thr Ala Asp 
520 

2 5ACT CGA ATT CAG ATC 

Thr Arg He Gin He 

' 535 

GAG ATC CCC CAT CGT 

3 0Glu lie Pro His Arg 

555 



CCG GTG GTG CTC GGG 
Pro Val Val Leu Gly 
460 

CAG TGG AAC GAG GCC 
Gin Trp Asn Glu Ala 
480 

GTC CAC TTC CTG TTA 
Val His Phe Leu Leu 
495 

AGG CGG CAA GGC CGC 
Arg Arg Gin Gly Arg 
510 

AGT ATT TTT AAT ACC 
Ser He Phe Asn Thr 
525 

CCA GCT GCT CCC ATC 
Pro Ala Ala Pro He 
540 

TCA GGC AAA GCA GCT 
Ser Gly Lys Ala Ala 
560 



GAG CAG GTG CAG CTC CCG 
Glu Gin Val Gin Leu Pro 
465 470 

GCC CTG GTG GTG CCC AAG 
Ala Leu Val Val Pro Lys 
485 

CGA AGT CTT TCA GAC AGT 
Arg Ser Leu Ser Asp Ser 
500 

TTT CTC TGG GAG ACC TAC 
Phe Leu Trp Glu Thr Tyr 
515 

GTG CTG GCC ATG ATT AGG 
Val Leu Ala Met lie Arg 
530 

CGG GAA GAG GTA GCG GCT 
Arg Glu Glu Val Ala Ala 
545 550 

GGA ACT GAC CCC AAC ATG 
Gly Thr Asp Pro Asn Met 
565 



GCT GAC AAT GGG GAC 
Ala Asp Asn Gly Asp 
35 570 

TAT GCC TCA CCT AAA 
Tyr Ala Ser Pro Lys 
585 

40 

TGT TAC CGT GGC TGG 
Cys Tyr Arg Gly Trp 
600 

4 5 CAC ACA CCC TTT GAT 
His Thr Pro Phe Asp 
615 

TCA GGG ACT GGA TTT 
50ser Gly Thr Gly Phe 

635 



CTG GAC CTG GGG CCG 
Leu Asp Leu Gly Pro 
575 

TAC CTC CGC AAT TTC 
Tyr Leu Arg Asn Phe 
590 

AAC TCT GCC CCG GGA 
Asn Ser Ala Pro Gly 
605 

CCT GTG TTG CCC TCT 
Pro Val Leu Pro Ser 
620 

CGG CCG ATC GGT GGC 
Arg Pro He Gly Gly 
640 



GTA GAG ACA GAA CCA CCC 
Val Glu Thr Glu Pro Pro 
580 

ACT CTG ACT GTC ACA GAC 
Thr Leu Thr Val Thr Asp 
595 

CGG TTC CAT CTT TTT CCC 
Arg Phe His Leu Phe Pro 
610 

GAG GCC AAA TTC TTG GGC 
Glu Ala Lys Phe Leu Gly 
625 630 

GGG GCT GGG GGC TCT GGC 
Gly Ala Gly Gly Ser Gly 
645 



AAG GAG TTC CAG 
Lys Glu Phe Gin 
55 650 

ACA GTT GTG ATG 
Thr Val Val Met 
665 

60 

CTG GAG AGA CTC 
Leu Glu Arg Leu 
680 



GCA GCG CTC GGA 
Ala Ala Leu Gly 



CTG ACC TAC GAG 
Leu Thr Tyr Glu 
670 

AAC GGC CTC CCC 
Asn Gly Leu Pro 
685 



GGC AAT GTC CAG 
Gly Asn Val Gin 
655 

CGG GAG GAA GTG 
Arg Glu Glu Val 



TAC CTG AAC AAG 
Tyr Leu Asn Lys 
690 



CGG GAG CAG TTC 
Arg Glu Gin Phe 
660 

CTC ATG AAC TCC 
Leu Met Asn Ser 
675 

GTA GTG GTG GTG 
Val Val Val Val 



i 



TGG AAC TCT CCC AAG CTG CCC TCG GAG GAC CTT TTG TGG CCA GAC ATT 2587 
Trp Asn Ser Pro Lys Leu Pro Ser Glu Asp Leu Leu Trp Pro Asp lie 
695 700 705 710 

5GGT GTC CCC ATC ATG GTC GTC CGT ACT GAG AAG AAC AGT TTG AAC AAT 2 635 

Gly Val Pro lie Met Val Val Arg Thr Glu Lys Asn Ser Leu Asn Asn 
715 720 725 

CGG TTC TTG CCC TGG AAT GAG ATT GAG ACA GAG GCC ATA CTG TCC ATC 2 683 

lOArg Phe Leu Pro Trp Asn Glu lie Glu Thr Glu Ala lie Leu Ser lie 
730 735 740 

GAC GAT GAT GCT CAC CTC CGC CAT GAT GAA ATC ATG TTT GGG TTT TGG 27 31 

Asp Asp Asp Ala His Leu Arg His Asp Glu lie Met Phe Gly Phe Trp 
15 745 750 755 

GTG TGG AGA GAA GCA CGT GAT CGC ATT GTG GGT TTC CCT GGC CGG TAC 2 77 9 

Val Trp Arg Glu Ala Arg Asp Arg lie Val Gly Phe Pro Gly Arg Tyr 
760 765 770 

20 

CAT GCG TGG GAC ATC CCG CAC CAG TCC TGG CTC TAC AAT TCC AAC TAC 2 827 

His Ala Trp Asp lie Pro His Gin Ser Trp Leu Tyr Asn Ser Asn Tyr 

775 780 785 790 

25TCC TGT GAG CTG TCC ATG GTG CTG ACG GGC GCT GCC TTC TTT CAC AAG 2875 
Ser Cys Glu Leu Ser Met Val Leu Thr Gly Ala Ala Phe Phe His Lys 
795 800 805 

TAT TAT GCC TAC CTG TAT TCT TAT GTG ATG CCC CAG GCC ATC CGG GAC 2 923 

3 0Tyr Tyr Ala Tyr Leu Tyr Ser Tyr Val Met Pro Gin Ala lie Arg Asp 

810 815 820 

ATG GTG GAC GAG TAC ATC AAC TGT GAG GAT ATC GCC ATG AAC TTC CTT 2 971 

Met Val Asp Glu Tyr lie Asn Cys Glu Asp lie Ala Met Asn Phe Leu 
35 825 830 835 

GTC TCC CAC ATC ACA CGG AAA CCC CCC ATC AAG GTG ACA TCA AGG TGG 3019 

Val Ser His lie Thr Arg Lys Pro Pro lie Lys Val Thr Ser Arg Trp 
840 845 850 

40 

ACT TTT CGA TGC CCA GGG TGC CCT CAG GCC CTG TCC CAT GAT GAC TCT 3067 

Thr Phe Arg Cys Pro Gly Cys Pro Gin Ala Leu Ser His Asp Asp Ser 

855 860 865 870 

4 Scat ttt cac gag cgg cac aag tgt atc aac ttt ttt gtc aag gtg tac 3115 

His Phe His Glu Arg His Lys Cys lie Asn Phe Phe Val Lys Val Tyr 
875 880 885 

GGC TAT ATG CCT CTC TTG TAC ACA CAG TTC AGG GTG GAC TCC GTG CTC 3163 
50Gly Tyr Met Pro Leu Leu Tyr Thr Gin Phe Arg Val Asp Ser Val Leu 
890 895 900 

TTC AAG ACC CGC CTG CCC CAT GAC AAG ACC AAG TGC TTC AAG TTC ATC 3211 
Phe Lys Thr Arg Leu Pro His Asp Lys Thr Lys Cys Phe Lys Phe lie 
55 905 910 915 

TAGGGCCTTG CAGTTCTGAG GAGACAATGA GCAGAGCGAG GGGGAGT CAC CCTCAAGGTT 3271 

CCCAAGGTGT CGAAGGTCCT TGGGGACATC TGTCGGGCAG GGCCAAGACC CTTTGCTGGG 3331 

60 

AGAGGCAGCA GGAAGAGTGG AAAGGGATAG CTGTCTTTCA TTTTGAAGTC AGCCACACTG 3391 

GGCCTGGGAT C CTG GTC AGA GACTCAGGNC GTCTGCACAG GGCACT GACT GATAGCGAAC 3451 
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ACTGAGGACT GTTCATAAGC CCAGGACA 34 79 

(2) INFORMATION FOR SEQ ID NO: 2: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 918 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

10 

(ii) MOLECULE TYPE: protein 

(x.i.) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

15Met Thr Gly Tyr Thr Met Leu Arg Asn Gly Gly Val Gly Asn Gly Gly 
1 5 10 15 

Gin Thr Cys Met Leu Arg Trp Ser Asn Arg lie Arg Leu Thr Trp Leu 
20 25 30 

20 

Ser Phe Thr Leu Phe lie lie Leu Val Phe Phe Pro Leu lie Ala His 
35 40 45 

Tyr Tyr Leu Thr Thr Leu Asp Glu Ala Asp Glu Ala Gly Lys Arg lie 
25 50 .55 60 

Phe Gly Pro Arg Ala Gly Ser Glu Leu Cys Glu Val Lys His Val Leu 
65 70 75 80 

3 0Asp Leu Cys Arg lie Arg Glu Ser Val Ser Glu Glu Leu Leu Gin Leu 

85 90 95 

Glu Ala Lys Arg Gin Glu Leu Asn Ser Glu lie Ala Lys Leu Asn Leu 
100 105 110 

35 

Lys lie Glu Ala Cys Lys Lys Ser lie Glu Asn Ala Lys Gin Asp Leu 
115 120 125 

Leu Gin Leu Lys Asn Val lie Ser Gin Thr Glu His Ser Tyr Lys Glu 
40 130 135 140 

Leu Met Ala Gin Asn Gin Pro Lys Leu Ser Leu Pro lie Arg Leu Leu 
145 150 155 160 

4 5 Pro Glu Lys Asp Asp Ala Gly Leu Pro Pro Pro Lys Val Thr Arg Gly 

165 170 175 

Cys Arg Leu His Asn Cys Phe Asp Tyr Ser Arg Cys Pro Leu Thr Ser 
180 185 190 

50 

Gly Phe Pro Val Tyr Val Tyr Asp Ser Asp Gin Phe Ala Phe Gly Ser 
195 200 205 

Tyr Leu Asp Pro Leu Val Lys Gin Ala Phe Gin Ala Thr Val Arg Ala 
55 210 215 220 

Asn Val Tyr Val Thr Glu Asn Ala Ala lie Ala Cys Leu Tyr Val Val 
225 230 235 240 

60Leu Val Gly Glu Met Gin Glu Pro Thr Val Leu Arg Pro Ala Asp Leu 

245 250 255 



Glu Lys Gin Leu Phe Ser Leu Pro His Trp Arg Thr Asp Gly His Asn 



260 



265 



270 



His Val lie lie Asn Leu Ser Arg Lys Ser Asp Thr Gin Asn Leu Leu 
275 280 285 

5 

Tyr Asn Val Ser Thr Gly Arg His Val Ala Gin Ser Thr Leu Tyr Ala 
290 295 300 

Ala Gin Tyr Arg Ala Gly Phe Asp Leu Val Val Ser Pro Leu Val His 
10305 310 315 320 

Ala Met Ser Glu Pro Asn Phe Met Glu lie Pro Pro Gin Val Pro Val 
325 330 335 

ISLys Arg Lys Tyr Leu Phe Thr Phe Gin Gly Glu Lys lie Glu Ser Leu 
340 345 350 

Arg Ser Ser Leu Gin Glu Ala Arg Ser Phe Glu Glu Glu Met Glu Gly 
355 360 365 

20 

Asp Pro Pro Ala Asp Tyr Asp Asp Arg lie lie Ala Thr Leu Lys Ala 
370 375 380 

Val Gin Asp Ser Lys Leu Asp Gin Val Leu Val Glu Phe Thr Cys Lys 
25385 390 395 400 

Asn Gin Pro Lys Pro Ser Leu Pro Thr Glu Trp Ala Leu Cys Gly Glu 
405 410 415 

3 0Arg Glu Asp Arg Leu Glu Leu Leu Lys Leu Ser Thr Phe Ala Leu lie 

420 425 430 

lie Thr Pro Gly Asp Pro Arg Leu Leu lie Ser Ser Gly Cys Ala Thr 
435 440 445 

35 

Arg Leu Phe Glu Ala Leu Glu Val Gly Ala Val Pro Val Val Leu Gly 
450 455 460 

Glu Gin Val Gin Leu Pro Tyr His Asp Met Leu Gin Trp Asn Glu Ala 
40465 470 475 480 

Ala Leu Val Val Pro Lys Pro Arg Val Thr Glu Val His Phe Leu Leu 
485 490 495 

4 5Arg Ser Leu Ser Asp Ser Asp Leu Leu Ala Met Arg Arg Gin Gly Arg 

500 505 510 

Phe Leu Trp Glu Thr Tyr Phe Ser Thr Ala Asp Ser lie Phe Asn Thr 
515 520 525 

50 

Val Leu Ala Met lie Arg Thr Arg lie Gin lie Pro Ala Ala Pro lie 
530 535 540 

Arg Glu Glu Val Ala Ala Glu lie Pro His Arg Ser Gly Lys Ala Ala 
55545 550 555 560 

Gly Thr Asp Pro Asn Met Ala Asp Asn Gly Asp Leu Asp Leu Gly Pro 
565 570 575 

60val Glu Thr Glu Pro Pro Tyr Ala Ser Pro Lys Tyr Leu Arg Asn Phe 
580 585 590 



Thr Leu Thr Val Thr Asp Cys Tyr Arg Gly Trp Asn Ser Ala Pro Gly 
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595 600 605 

Arg Phe His Leu Phe Pro His Thr Pro Phe Asp Pro Val Leu Pro Ser 
610 615 620 

5 

Glu Ala Lys Phe Leu Gly Ser Gly Thr Gly Phe Arg Pro lie Gly Gly 
62 5 630 635 640 

Gly Ala Gly Gly Ser Gly Lys Glu Phe Gin Ala Ala Leu Gly Gly Asn 
!0 645 650 655 

Val Gin Arg Glu Gin Phe Thr Val Val Met Leu Thr Tyr Glu Arg Glu 
660 665 670 

15Glu Val Leu Met Asn Ser Leu Glu Arg Leu Asn Gly Leu Pro Tyr Leu 
675 680 685 

Asn Lys Val Val Val Val Trp Asn Ser Pro Lys Leu Pro Ser Glu Asp 
690 695 700 

20 

Leu Leu Trp Pro Asp lie Gly Val Pro lie Met Val Val Arg Thr Glu 
705 710 715 720 

Lys Asn Ser Leu Asn Asn Arg Phe Leu Pro Trp Asn Glu lie Glu Thr 
25 725 730 735 

Glu Ala lie Leu Ser lie Asp Asp Asp Ala His Leu Arg His Asp Glu 
740 745 750 

3 0lle Met Phe Gly Phe Trp Val Trp Arg Glu Ala Arg Asp Arg lie Val 
755 760 765 

Gly Phe Pro Gly Arg Tyr His Ala Trp Asp lie Pro His Gin Ser Trp 
770 775 780 

35 

Leu Tyr Asn Ser Asn Tyr Ser Cys Glu Leu Ser Met Val Leu Thr Gly 
785 790 795 800 

Ala Ala Phe Phe His Lys Tyr Tyr Ala Tyr Leu Tyr Ser Tyr Val Met 
40 805 810 815 

Pro Gin Ala He Arg Asp Met Val Asp Glu Tyr He Asn Cys Glu Asp 
820 825 830 

45lle Ala Met Asn Phe Leu Val Ser His He Thr Arg Lys Pro Pro lie 
835 840 845 

Lys Val Thr Ser Arg Trp Thr Phe Arg Cys Pro Gly Cys Pro Gin Ala 
850 855 860 

50 

Leu Ser His Asp Asp Ser His Phe His Glu Arg His Lys Cys He Asn 
865 870 875 880 

Phe Phe Val Lys Val Tyr Gly Tyr Met Pro Leu Leu Tyr Thr Gin Phe 
55 885 890 895 

Arg Val Asp Ser Val Leu Phe Lys Thr Arg Leu Pro His Asp Lys Thr 
900 905 910 

6 0Lys Cys Phe Lys Phe He 
915 

(2) INFORMATION FOR SEQ ID NO: 3: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6172 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



10 (ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 594.. 3350 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



GGCGGGTCCC 


TGAGCTGGAA 


GCCGGAGAGC 


AAGCCCTGGA 


GGTTCACTCT 


TTCAAGAAGT 


60 


CGTGTGCTGA 

20 

CTGATTGTTG 


GGTGTAATGC 


TACACAAGTC 


AGAGGAAGGA 


AGGGTCCTGA 


AACACATGGC 


120 


GCAAAGGCAT 


CATAAGAAGC 


TGGCATTTAT 


TTCTGTTCTA 


ACCTATTACT 


180 


GTATAACTGT 


GAATAGACAC 


TATGCATATT 


TGTTGGTCAG 


CAAAACCAAG 


AAACAAGAGC 


240 


2 5TATGGCATTT 


GAAAAAGTCT 


GTCTGATTCC 


AGGGTGTTTT 


TCCTGGGTTT 


CATCATCAGG 


300 


TACCTCCTCC 


CTTTCATCTC 


AGCAAGAATG 


TGGCACCTTT 


TATCGTTTGA 


TAAAGATTAA 


360 


GGACATGTTC 

30 

TTTCAGCTGC 


TTTGGTCAAC 


AGCCAGAACT 


TAAAATCTGC 


TGGAATAGGG 


TCAGAGACCA 


420 


AGCTGAGGAA 


AATGAAATGT 


TCATTTTATT 


TGGTGCCTTG 


TCTGGGGAGC 


480 


ACACTAACTC 


TTCTGGAAAC 


GTGTCAGTGA 


AACAGAGATC 


GTTTTGTGGA 


ATAGCAACCC 


540 


3 5ATGGTTATGG 


CGAGTGACCC 


GACGTGATCT 


GGGGGGGAGG 


CTGCAGAGGA 


CTC ATG 


596 



Met 

ACA GGC TAT ACC ATG CTG CGG AAT GGG GGC GCG GGG AAC GGA GGT CAG 644 
4 0Thr Gly Tyr Thr Met Leu Arg Asn Gly Gly Ala Gly Asn Gly Gly Gin 
920 925 930 935 

ACC TGC ATG CTG CGC TGG TCC AAC CGC ATC CGC CTC ACG TGG CTC AGC 692 
Thr Cys Met Leu Arg Trp Ser Asn Arg lie Arg Leu Thr Trp Leu Ser 
45 940 945 950 

TTC ACG CTC TTT GTC ATC CTG GTC TTC TTC CCG CTC ATC GCC CAC TAT 740 

Phe Thr Leu Phe Val lie Leu Val Phe Phe Pro Leu lie Ala His Tyr 

955 960 965 

50 

TAC CTC ACC ACT CTG GAT GAG GCT GAT GAG GCA GGC AAG CGG ATT TTT 788 

Tyr Leu Thr Thr Leu Asp Glu Ala Asp Glu Ala Gly Lys Arg lie Phe 

970 975 980 

5 5 GGT CCC CGG GTG GGG AAC GAG CTG TGC GAG GTG AAG CAC GTG CTG GAT 836 
Gly Pro Arg Val Gly Asn Glu Leu Cys Glu Val Lys His Val Leu Asp 
985 990 995 

CTG TGC CGC ATC CGG GAG TCG GTG AGT GAA GAG CTC CTG CAG CTG GAG 884 
60Leu Cys Arg lie Arg Glu Ser Val Ser Glu Glu Leu Leu Gin Leu Glu 
1000 1005 1010 1015 

GCC AAG CGC CAA GAG CTG AAC AGC GAG ATC GCC AAG CTG AAT CTG AAG 932 
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Ala Lys Arg Gin Glu Leu Asn Ser Glu lie Ala Lys Leu Asn Leu Lys 
1020 1025 1030 

ATC GAA GCC TGT AAG AAG AGC ATT GAG AAC GCC AAG CAG GAC CTG CTC 980 
5 lie Glu Ala Cys Lys Lys Ser lie Glu Asn Ala Lys Gin Asp Leu Leu 
1035 1040 1045 

CAG CTC AAG AAT GTC ATC AGC CAG ACC GAG CAT TCC TAC AAG GAG CTC 1028 
Gin Leu Lys Asn Val lie Ser Gin Thr Glu His Ser Tyr Lys Glu Leu 
10 1050 1055 1060 

ATG GCC CAG AAC CAG CCC AAG CTG TCC CTG CCC ATC CGA CTG CTC CCA 107 6 

Met Ala Gin Asn Gin Pro Lys Leu Ser Leu Pro lie Arg Leu Leu Pro 
1065 1070 1075 

15 

GAG AAG GAC GAT GCC GGC CTC CCT CCC CCG AAG GCC ACT CGG GGC TGC 1124 

Glu Lys Asp Asp Ala Gly Leu Pro Pro Pro Lys Ala Thr Arg Gly Cys 
1080 1085 1090 1095 

2 0CGG CTA CAC AAC TGC TTT GAT TAT TCT CGT TGC CCT CTC ACC TCT GGC 1172 
Arg Leu His Asn Cys Phe Asp Tyr Ser Arg Cys Pro Leu Thr Ser Gly 
1100 1105 1110 

TTC CCG GTC TAC GTC TAT GAC AGT GAC CAG TTT GTC TTT GGC AGC TAC 1220 
2 5 Phe Pro Val Tyr Val Tyr Asp Ser Asp Gin Phe Val Phe Gly Ser Tyr 
1115 1120 1125 ■ 

CTG GAT CCC TTG GTC AAG CAG GCT TTT CAG GCG ACA GCA CGA GCT AAC 1268 
Leu Asp Pro Leu Val Lys Gin Ala Phe Gin Ala Thr Ala Arg Ala Asn 
30 1130 1135 1140 

GTT TAT GTT ACA GAA AAT GCA GAC ATC GCC TGC CTT TAC GTG ATA CTA 1316 

Val Tyr Val Thr Glu Asn Ala Asp lie Ala Cys Leu Tyr Val lie Leu 

1145 1150 1155 

35 

GTG GGA GAG ATG CAG GAG CCC GTG GTG CTG CGG CCT GCT GAG CTG GAG 1364 

Val Gly Glu Met Gin Glu Pro Val Val Leu Arg Pro Ala Glu Leu Glu 

1160 H65 1170 1175 

4 0AAG CAG TTG TAT TCC CTG CCA CAC TGG CGG ACG GAT GGA CAC AAC CAT 1412 
Lys Gin Leu Tyr Ser Leu Pro His Trp Arg Thr Asp Gly His Asn His 
1180 1185 1190 

GTC ATC ATC AAT CTG TCA CGT AAG TCA GAT ACA CAG AAC CTT CTC TAT 14 60 

45Val lie lie Asn Leu Ser Arg Lys Ser Asp Thr Gin Asn Leu Leu Tyr 
1195 1200 1205 

AAC GTC AGT ACT GGC CGT GCC ATG GTG GCC CAG TCC ACC TTC TAC ACT 1508 
Asn Val Ser Thr Gly Arg Ala Met Val Ala Gin Ser Thr Phe Tyr Thr 
50 1210 1215 1220 

GTC CAG TAC AGA CCT GGC TTT GAC TTG GTC GTA TCA CCG CTG GTC CAT 1556 

Val Gin Tyr Arg Pro Gly Phe Asp Leu Val Val Ser Pro Leu Val His 
1225 1230 1235 

55 

GCC ATG TCT GAG CCC AAC TTC ATG GAA ATC CCA CCA CAG GTG CCG GTG 1604 

Ala Met Ser Glu Pro Asn Phe Met Glu lie Pro Pro Gin Val Pro Val 

1240 1245 1250 1255 

6 0AAG CGG AAA TAT CTC TTC ACC TTC CAG GGC GAG AAG ATT GAG TCT CTG 1652 
Lys Arg Lys Tyr Leu Phe Thr Phe Gin Gly Glu Lys lie Glu Ser Leu 
1260 1265 1270 
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AGG TCT AGC CTT CAG GAG GCC CGC TCC TTC GAA GAG GAA ATG GAG GGC 17 00 

Arg Ser Ser Leu Gin Glu Ala Arg Ser Phe Glu Glu Glu Met Glu Gly 
1275 1280 1285 

5GAC CCT CCC GCC GAC TAC GAT GAC CGG ATC ATT GCC ACC CTG AAG GCG 17 4 8 

Asp Pro Pro Ala Asp Tyr Asp Asp Arg lie lie Ala Thr Leu Lys Ala 
1290 1295 1300 

GTG CAG GAC AGC AAG CTG GAT CAG GTC CTG GTG GAA TTC ACC TGC AAA 17 96 

lOVal Gin Asp Ser Lys Leu Asp Gin Val Leu Val Glu Phe Thr Cys Lys 
1305 1310 1315 

AAC CAG CCC AAA CCC AGC CTG CCG ACT GAG TGG GCA CTG TGT GGA GAG 18 44 

Asn Gin Pro Lys Pro Ser Leu Pro Thr Glu Trp Ala Leu Cys Gly Glu 
151320 1325 1330 1335 

CGG GAG GAC CGC TTG GAA TTG CTG AAG CTC TCC ACC TTC GCC CTC ATC 18 92 

Arg Glu Asp Arg Leu Glu Leu Leu Lys Leu Ser Thr Phe Ala Leu lie 
1340 1345 1350 

20 

ATT ACC CCC GGG GAC CCT CGC TTG GTT ATT TCC TCT GGG TGT GCA ACA 194 0 

lie Thr Pro Gly Asp Pro Arg Leu Val lie Ser Ser Gly Cys Ala Thr 
1355 1360 1365 

2 5 CGG CTC TTC GAA GCC CTG GAA GTC GGT GCC GTC CCG GTG GTG CTG GGG 198 8 

Arg Leu Phe Glu Ala Leu Glu Val Gly Ala Val Pro Val Val Leu Gly 
1370 1375 1380 

GAG CAG GTC CAG CTT CCC TAC CAG GAC ATG CTG CAG TGG AAC GAG GCG 2036 

3 0Glu Gin Val Gin Leu Pro Tyr Gin Asp Met Leu Gin Trp Asn Glu Ala 

1385 1390 1395 

GCC CTG GTG GTG CCA AAG CCT CGT GTT ACC GAG GTT CAT TTC CTG CTC 2 084 

Ala Leu Val Val Pro Lys Pro Arg Val Thr Glu Val His Phe Leu Leu 
351400 1405 1410 1415 

AGA AGC CTC TCC GAT AGT GAC CTC CTG GCT ATG AGG CGG CAA GGC CGC 2132 

Arg Ser Leu Ser Asp Ser Asp Leu Leu Ala Met Arg Arg Gin Gly Arg 
1420 1425 1430 

40 

TTT CTC TGG GAG ACT TAC TTC TCC ACT GCT GAC AGT ATT TTT AAT ACC 2180 

Phe Leu Trp Glu Thr Tyr Phe Ser Thr Ala Asp Ser lie Phe Asn Thr 
1435 1440 1445 

4 5 GTG CTG GCT ATG ATT AGG ACT CGC ATC CAG ATC CCA GCC GCT CCC ATC 222 8 

Val Leu Ala Met lie Arg Thr Arg lie Gin lie Pro Ala Ala Pro lie 
1450 1455 1460 

CGG GAA GAG GCG GCA GCT GAG ATC CCC CAC CGT TCA GGC AAG GCG GCT 2276 
SOArg Glu Glu Ala Ala Ala Glu lie Pro His Arg Ser Gly Lys Ala Ala 
1465 1470 1475 

GGA ACT GAC CCC AAC ATG GCT GAC AAC GGG GAC CTG GAC CTG GGG CCA 232 4 

Gly Thr Asp Pro Asn Met Ala Asp Asn Gly Asp Leu Asp Leu Gly Pro 
551480 1485 1490 1495 

GTG GAG ACG GAG CCG CCC TAC GCC TCA CCC AGA TAC CTC CGC AAT TTC 2372 

Val Glu Thr Glu Pro Pro Tyr Ala Ser Pro Arg Tyr Leu Arg Asn Phe 
1500 1505 1510 

60 

ACT CTG ACT GTC ACT GAC TTT TAC CGC AGC TGG AAC TGT GCT CCA GGG 242 0 

Thr Leu Thr Val Thr Asp Phe Tyr Arg Ser Trp Asn Cys Ala Pro Gly 
1515 1520 1525 
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CCT TTC CAT CTT TTC* CCC CAC ACT CCC TTT GAC CCT GTG TTG CCC TCA 2468 
Pro Phe His Leu Phe Pro His Thr Pro Phe Asp Pro Val Leu Pro Ser 
1530 1535 1540 

5 GAG GCC AAA TTC TTG GGC TCA GGG ACT GGC TTT CGG CCT ATT GGT GGT 2516 
Glu Ala Lys Phe Leu Gly Ser Gly Thr Gly Phe Arg Pro lie Gly Gly 
1545 1550 1555 

GGA GCT GGG GGT TCT GGC AAG GAA TTT CAG GCA GCG CTT GGA GGC AAT 2564 
lOGly Ala Gly Gly Ser Gly Lys Glu Phe Gin Ala Ala Leu Gly Gly Asn 
1560 1565 1570 1575 

GTT CCC CGA GAG CAG TTC ACG GTG GTG ATG TTG ACT TAT GAG CGG GAG 2 612 

Val Pro Arg Glu Gin Phe Thr Val Val Met Leu Thr Tyr Glu Arg Glu 
15 1580 1585 1590 

GAA GTG CTT ATG AAC TCT TTA GAG AGG CTG AAT GGC CTC CCT TAC CTG 2 660 

Glu Val Leu Met Asn Ser Leu Glu Arg Leu Asn Gly Leu Pro Tyr Leu 
1595 1600 1605 

20 

AAC AAG GTC GTG GTG GTG TGG AAT TCT CCC AAG CTG CCA TCA GAG GAC 2708 
Asn Lys Val Val Val Val Trp Asn Ser Pro Lys Leu Pro Ser Glu Asp 
— 1610 1615 1620 

2 5 CTT CTG TGG CCT GAC ATT GGC GTT CCC ATC ATG GTG GTC CGT ACT GAG 2756 
Leu Leu Trp Pro Asp lie Gly Val Pro lie Met Val Val Arg Thr Glu 
1625 1630 1635 

AAG AAC AGT TTG AAC AAC CGA TTC TTA CCC TGG AAT GAA ATT GAG ACA 2 8 04 

3 0Lys Asn Ser Leu Asn Asn Arg Phe Leu Pro Trp Asn Glu lie Glu Thr 

1640 1645 1650 1655 

GAG GCC ATC CTG TCC ATT GAT GAC GAT GCT CAC CTC CGC CAT GAC GAA 2852 
Glu Ala lie Leu Ser lie Asp Asp Asp Ala His Leu Arg His Asp Glu 
35 1660 1665 1670 

ATC ATG TTT GGG TTC CGG GTG TGG AGA GAA GCT CGG GAC CGC ATC GTG 29 00 

lie Met Phe Gly Phe Arg Val Trp Arg Glu Ala Arg Asp Arg lie Val 
1675 1680 1685 

40 

GGC TTC CCT GGC CGT TAC CAC GCA TGG GAC ATC CCC CAT CAG TCC TGG 2 948 

Gly Phe Pro Gly Arg Tyr His Ala Trp Asp lie Pro His Gin Ser Trp 
1690 1695 1700 

4 5 CTC TAC AAC TCC AAC TAC TCC TGT GAG CTG TCC ATG GTG CTG ACA GGT 2996 

Leu Tyr Asn Ser Asn Tyr Ser Cys Glu Leu Ser Met Val Leu Thr Gly 
1705 1710 1715 

GCT GCC TTC TTT CAC AAG TAT TAT GCC TAC CTG TAT TCT TAT GTG ATG 3044 
50Ala Ala Phe Phe His Lys Tyr Tyr Ala Tyr Leu Tyr Ser Tyr Val Met 
1720 1725 1730 1735 

CCC CAG GCC ATC CGG GAC ATG GTG GAT GAA TAC ATC AAC TGT GAG GAC 3092 
Pro Gin Ala lie Arg Asp Met Val Asp Glu Tyr lie Asn Cys Glu Asp 
55 1740 1745 1750 

ATT GCC ATG AAC TTC CTT GTC TCC CAC ATC ACT CGG AAG CCC CCC ATC 3140 
lie Ala Met Asn Phe Leu Val Ser His lie Thr Arg Lys Pro Pro lie 
1755 1760 1765 

60 

AAG GTG ACC TCA CGG TGG ACA TTC CGA TGC CCA GGA TGC CCT CAG GCC 318 8 

Lys Val Thr Ser Arg Trp Thr Phe Arg Cys Pro Gly Cys Pro Gin Ala 
1770 1775 1780 
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CTG TCT CAT GAT GAC TCC CAC TTC CAC GAG CGG CAC AAG TGC ATC AAC 323 6 

Leu ser His Asp Asp Ser His Phe His Glu Arg His Lys Cys lie Asn 
1785 1790 1795 

5TTC TTC GTG AAG GTG TAC GGC TAC ATG CCC CTC CTG TAC ACG CAG TTC 32 8 4 

Phe Phe Val Lys Val Tyr Gly Tyr Met Pro Leu Leu Tyr Thr Gin Phe 
1800 1805 1810 1815 

AGG GTG GAT TCT GTG CTC TTC AAG ACA CGC CTG CCC CAT GAC AAG ACC 3332 
lOAra val Asp Ser Val Leu Phe Lys Thr Arg Leu Pro His Asp Lys Thr 
9 1820 1825 1830 

AAG TGC TTC AAG TTC ATC TAGGGGCAGC GCACGGTCTG GGGAAGAGGA 338 0 

Lys Cys Phe Lys Phe lie 
15 1835 



TGAGCAGAGG 


GAGGAAGATG 


GCTCCCAAGG 


TTC CTAGGCA 


TTGCAGGACC 


TTGGGCACAT 


3440 


CTGCTGGTGG 


GTGGCCCAGA 


GCCTCTGCTG 


GAAGGGGCAG 


CAGGAGGAGT 


GGAAGGAAAC 


3500 


2 0 

CGCTGCCTTT 


ATCTTGAAGT 


CAG C CACAC T 


GGGCCTGGAG 


CCCTGGGCGG 


AGTCCCCGGG 


3560 


GTTCCCCACA 


CAGGGCACTG 


ACT GATAGCT 


TACACTGAGG 


ACTGTGGCGA 


CTCTGCAGAG 


3620 


2 5 TCACT CACAC 


CGTTCGTACG 


CCCAGGACAG 


CTGGTTCGTG 


GTTTTTACAT 


TCAATAACAA 


3680 


C T ATT AT GAT 


TAT TTAAAAA 


GAGAAAGTTT 


CAGATTTGCC 


ATT CAAGGCT 


T ATT TAT AT A 


3740 


TATGTGTGTG 


TATATAAATA 


CAT GCACACA 


CTTGCATACA 


TATATATTTT 


TGGCTGGGGG 


3800 


AGTGTGAGTT 


TTGCCTTTCT 


AAGGGAGGGA 


CCGCGCAGGC 


TCCTTTGTTC 


TGTATTCTGG 


3860 


CGGAGAT GGG 


TCCTGGCCTT 


GTGTCACTGG 


CTTATCCTTA 


AAGAT CAT CT 


CCCATCCTCC 


3920 


3 5CCAGCGCCAT 


CTGTGTGCAG 


CAACCAGAAA 


GGGAT GAACT 


TGGCCCTCTT 


GCGGGCCTGG 


3980 


ACAAGGTCTC 


TTCCTTACCC 


TTTCTGTTGC 


CAGT CAGCAA 


CCTGTAACTC 


ACATTCTCTT 


4040 


C C CAGT GAAT 


CCCTGGGAGC 


GCCTGACCCT 


GGTGGGCTGT 


TCAGCTTCCT 


GCTGCTGGGG 


4100 


40 

CCAGCGATTT 


TTGAGGATTT 


ATCTTTAGGC 


CAGGCTTGCC 


TCCGTACTTA 


TCCCTGCTCT 


4160 


CCCATTTCTC 


TCTTGTTTGA 


GAGAGAATGA 


GGAAGCAAAG 


AGT GAGAAAG 


AATAGGGGCT 


4220 


4 5GAAGACGCCA 


CTCCCAGATG 


GCTCTTTCTA 


TCCTGCTCTT 


CTGTTGAAAC 


ACACGTGCTG 


4280 


TGGGCCTCAG 


GCGTTTCTGA 


AGTGCTCTTT 


CTTGGATTGG 


ACAGGAGATC 


AGCAGCGTGC 


4340 


ACATCTGCTG 

50 

CCAGTGTCCT 


TGGTCTGAAG 


TGGTTT GCAG 


GTCAGCCTCC 


T CT C C CT AGT 


GTAGAGCAAG 


4400 


TCGAGGAACC 


CACCCGGCTG 


GCCGGGAAGT 


TTTACAGCAA 


GGCGCCTGCC 


4460 


TTGGGATAAT 


TCCTTGGTGA 


AATTCACCTT 


CCCCCCGCCT 


CTGTCTGGAG 


CCCCATCCTG 


4520 


55TGTTATCTGT 


GGTTTTTGGA 


CCCCTAATGT 


CAGCTTGGCT 


GTAGGACTCC 


CCGAGGTTTG 


4580 


GTATGTGCTA 


GAACAATGGG 


AGGCTGTGAT 


TTGCTGTGTA 


AGCTCACATC 


CAGCCTTGGA 


4640 


ATCTAACGGG 


CATTCACAAC 


CCGAGTTACC 


ACTTTCCACT 


CCCTGCTTAG 


GATTCTGTTC 


4700 


60 

CCTGGGCTGA 


AACT GAAATA 


AGCTAATTTT 


TTGGGTCACG 


GT GGCAGTAG 


GGGAAC CTAG 


4760 


GAGGGT GTGA 


GTGGCATTTG 


TCAGGGATTT 


AGCCCATGAC 


GTGTTTCTTG 


AACCCTACTT 


4820 
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TCT GGAAGTG GAGTTGACTC TGGAAGTTTT CTAGCAACTG AACAAAAGCT CAGGTTTGTC 4 8 80 

CTGGTCATGC ACATGCCTTA AGCCAGTTCC GTCTTCCCTA GACCTTGGCA TCCTGTGCTT 4 94 0 

5CTATTTCTTG GAATACGTTC TCCTCTGACC TGCCTGTACC ACGTGGGTCC TCTTCAAGTA 5000 

CTGTTTTGAA GCTGGGCTCT TTTGTGTAGC TCCCACCCAC CTGTAGGGCT AGCTCGGCTT 5060 

AAGGGAACTC TCCCCATTGG CAAACCGGAC CCGGCCGCCG CCAGGACTGT GTTTCCAAAG 5120 

10 

GTTCCCCGCC CCCAACCCCA GCATCAGCCT GTAGCTCCCC TGCTGAGGCA GTGTGGTTAT 5180 

GTTCCCAGCA GTGGGGGTCA GACGCCCTTC CTCAGAACTT TCTAGTTGCC CTCTACCTGA 52 40 

15CTCCTGACTT GTATTCCTTT TAGCAGTAGC - CTTCTTCCCT CGGGGAGCCA AAGAGTGTGG 5300 

TGTGT GGCGC TAT ATT GT GG CTGCTATTTC ATCTGGTTTC TTTTAATGTG AGGAACTCAC 5360 

ATACTGACTT CAGTGGGACT CGGTGAGCCG GGGCCGTCTG TGTGGTGGGA CCCCCTTTAG 5420 

20 

CGGGACTCAG TGAGCTGGGG CCGTCTGTGT GGTGGAGCCA GGGCCTCTCC CTTTAGTGGA 54 8 0 

GCCAGGTTGT CGGGCCCCGA ATGTCACTGG TGGATCTAAG AAGGGCTGAG TGGTCTGACA 5540 

2 5 CCAAAACATG CCGCAGGGAG GGCTGTGGTG CCGGTGCTTC CAACAAGGAC AGCCCTCCTT 5600 

GACCCTGAAA GGAACACTGG CTTGAAGGAC TGCAGACAGG CTCTGAGGGG CACGCCCTCC 5660 

TCAGCGAGAG GCAGCAAGGT GGCCACAGTG TCACTGGTCA GGTGCTTCTC ACCAC GGGAA 5720 

30 

AGCCGCCGAC CTGTGACTCG CTTGAGATGG GAAAGCGGCG CCACAGACCC CGGGTCTCCT 57 80 

TGGCTGTCTG TGGGCCGCCC CTGGCCACCT TGTCCTGGCT CGCAGGGTGC AGGAGCGCCT 584 0 

3 5 CGTTCTCTGG GTGGCCGGCT TGCTGCTCCG GTTTGGGCTG TCTTACCATA ACACCGTCCC 5900 

AGGGCTCTGC AGGCCACTGT GAGCGCTGGC TCCCTGGGCA GTGCTCCTCC GTGTGGACTG 5960 

TGCCTCAGGC CAGGGCTCAC CAGCTGGGGT CCTGTCCGGA AGGATGGGAT CTTTCTGGGA 6020 

40 

GCTGCGCCGG ACAGAGTGGG GAGCTCCTAG TTTGTGGGGG GAAGCTTTGA TAT C CATGCC 608 0 

ACGTCCATCC ACCCCACCCC TTTTCGTCAC GAG C ACAAT G GTCTTACATT GGATTTTTGT 6140 

4 5AAAAAAATAA AAATAAATGG AGACTTTAAC TC 6172 

(2) INFORMATION FOR SEQ ID NO: 4: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 919 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

55 (ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Thr Gly Tyr Thr Met Leu Arg Asn Gly Gly Ala Gly Asn Gly Gly 
60 1 5 10 15 

Gin Thr Cys Met Leu Arg Trp Ser Asn Arg lie Arg Leu Thr Trp Leu 
20 25 30 
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Ser Phe Thr Leu 
35 

Tyr Tyr Leu Thr 
5. 50 

Phe Gly Pro Arg 
65 

lOAsp Leu Cys Arg 



Glu Ala Lys Arg 
100 

15 

Lys lie Glu Ala 
115 



Phe Val He Leu 
40 

Thr Leu Asp Glu 
55 

val Gly Asn Glu 
70 

He Arg Glu Ser 
85 

Gin Glu Leu Asn 



Cys Lys Lys Ser 
120 



Val Phe Phe Pro 



Ala Asp Glu Ala 
60 

Leu Cys Glu Val 
75 

Val Ser Glu Glu 
90 

Ser Glu lie Ala 
105 

He Glu Asn Ala 



Leu lie Ala His 
45 

Gly Lys Arg lie 



Lys His Val Leu 
80 

Leu Leu Gin Leu 
95 

Lys Leu Asn Leu 
110 

Lys Gin Asp Leu 
125 



Leu Gin Leu Lys Asn Val lie Ser Gin Thr Glu His Ser Tyr Lys Glu 

20 130 135 140 

Leu Met Ala Gin Asn Gin Pro Lys Leu Ser Leu Pro He Arg Leu Leu 
145 150 155 160 

2 5Pro Glu Lys Asp Asp Ala Gly Leu Pro Pro Pro Lys Ala Thr Arg Gly 

165 170 175 



Cys Arg Leu His 
180 

30 

Gly Phe Pro Val 
195 

Tyr Leu Asp Pro 
35 210 

Asn Val Tyr Val 
225 

4 0Leu Val Gly Glu 



Asn Cys Phe Asp 



Tyr Val Tyr Asp 
200 

Leu Val Lys Gin 
215 

Thr Glu Asn Ala 
230 

Met Gin Glu Pro 
245 



Tyr Ser Arg Cys 
185 



Ser Asp Gin Phe 



Ala Phe Gin Ala 
220 

Asp He Ala Cys 
235 

Val Val Leu Arg 
250 



Pro Leu Thr Ser 
190 

Val Phe Gly Ser 
205 

Thr Ala Arg Ala 



Leu Tyr Val He 
240 

Pro Ala Glu Leu 
255 



Glu Lys Gin Leu 
260 

45 

His Val He lie 
275 

Tyr Asn Val Ser 
50 290 

Thr Val Gin Tyr 
305 

55His Ala Met Ser 



Val Lys Arg Lys 
340 

60 

Leu Arg Ser Ser 
355 



Tyr Ser Leu Pro 



Asn Leu Ser Arg 
280 

Thr Gly Arg Ala 
295 

Arg Pro Gly Phe 
310 

Glu Pro Asn Phe 
325 

Tyr Leu Phe Thr 



Leu Gin Glu Ala 
360 



His Trp Arg Thr 
265 

Lys Ser Asp Thr 



Met Val Ala Gin 
300 

Asp Leu Val Val 
315 

Met Glu He Pro 
330 

Phe Gin Gly Glu 
345 

Arg Ser Phe Glu 



Asp Gly His Asn 
270 

Gin Asn Leu Leu 
285 

Ser Thr Phe Tyr 



Ser Pro Leu Val 
320 

Pro Gin Val Pro 
335 

Lys He Glu Ser 
350 

Glu Glu Met Glu 
365 
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Gly Asp Pro Pro Ala Asp Tyr Asp Asp Arg lie lie Ala Thr Leu Lys 
370 375 380 

Ala Val Gin Asp Ser Lys Leu Asp Gin Val Leu Val Glu Phe Thr Cys 
5385 390 395 400 

Lys Asn Gin Pro Lys Pro Ser Leu Pro Thr Glu Trp Ala Leu Cys Gly 
405 410 415 

lOGlu Arg Glu Asp Arg Leu Glu Leu Leu Lys Leu Ser Thr Phe Ala Leu 
420 425 430 

lie He Thr Pro Gly Asp Pro Arg Leu Val He Ser Ser Gly Cys Ala 
435 440 445 

15 

Thr Arg Leu Phe Glu Ala Leu Glu Val Gly Ala Val Pro Val Val Leu 
450 455 460 

Gly Glu Gin Val Gin Leu Pro Tyr Gin Asp Met Leu Gin Trp Asn Glu 
20465 470 475 480 

Ala Ala Leu Val Val Pro Lys Pro Arg Val Thr Glu Val His Phe Leu 
485 490 495 

2 5Leu Arg Ser Leu Ser Asp Ser Asp Leu Leu Ala Met Arg Arg Gin Gly 
500 505 510 

Arg Phe Leu Trp Glu Thr Tyr Phe Ser Thr Ala Asp Ser lie Phe Asn 
515 520 525 

30 

Thr Val Leu Ala Met lie Arg Thr Arg lie Gin lie Pro Ala Ala Pro 
530 535 540 

lie Arg Glu Glu Ala Ala Ala Glu He Pro His Arg Ser Gly Lys Ala 
35545 550 555 560 

Ala Gly Thr Asp Pro Asn Met Ala Asp Asn Gly Asp Leu Asp Leu Gly 
565 570 575 

4 0Pro Val Glu Thr Glu Pro Pro Tyr Ala Ser Pro Arg Tyr Leu Arg Asn 
580 585 590 

Phe Thr Leu Thr Val Thr Asp Phe Tyr Arg Ser Trp Asn Cys Ala Pro 
595 600 605 

45 

Gly Pro Phe His Leu Phe Pro His Thr Pro Phe Asp Pro Val Leu Pro 
610 615 620 

Ser Glu Ala Lys Phe Leu Gly Ser Gly Thr Gly Phe Arg Pro lie Gly 
50625 630 635 640 

Gly Gly Ala Gly Gly Ser Gly Lys Glu Phe Gin Ala Ala Leu Gly Gly 
645 650 655 

55Asn Val Pro Arg Glu Gin Phe Thr Val Val Met Leu Thr Tyr Glu Arg 
660 665 670 

Glu Glu Val Leu Met Asn Ser Leu Glu Arg Leu Asn Gly Leu Pro Tyr 
675 680 685 

60 

Leu Asn Lys Val Val Val Val Trp Asn Ser Pro Lys Leu Pro Ser Glu 
690 695 700 
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Asp Leu Leu Trp Pro Asp lie Gly Val Pro He Met Val Val Arg Thr 
705 710 715 720 

Glu Lys Asn Ser Leu Asn Asn Arg Phe Leu Pro Trp Asn Glu He Glu 
5 725 730 735 

Thr Glu Ala lie Leu Ser lie Asp Asp Asp Ala His Leu Arg His Asp 
740 745 750 

lOGlu He Met Phe Gly Phe Arg Val Trp Arg Glu Ala Arg Asp Arg lie 
755 760 765 

Val Gly Phe Pro Gly Arg Tyr His Ala Trp Asp lie Pro His Gin Ser 
770 775 780 

15 

Trp Leu Tyr Asn Ser Asn Tyr Ser Cys Glu Leu Ser Met Val Leu Thr 
785 790 795 800 

Gly Ala Ala Phe Phe His Lys Tyr Tyr Ala Tyr Leu Tyr Ser Tyr Val 
20 805 810 815 

Met Pro Gin Ala He Arg Asp Met Val Asp Glu Tyr lie Asn Cys Glu 
_ 820 825 830 

25Asp He Ala Met Asn Phe Leu Val Ser His He Thr Arg Lys Pro Pro 
835 840 845 

He Lys Val Thr Ser Arg Trp Thr Phe Arg Cys Pro Gly Cys Pro Gin 
850 855 860 

30 

Ala Leu Ser His Asp Asp Ser His Phe His Glu Arg His Lys Cys He 
865 870 875 880 

Asn Phe Phe Val Lys Val Tyr Gly Tyr Met Pro Leu Leu Tyr Thr Gin 
35 885 890 895 

Phe Arg Val Asp Ser Val Leu Phe Lys Thr Arg Leu Pro His Asp Lys 
900 905 910 

4 OThr Lys Cys Phe Lys Phe He 
fU5 

(2) INFORMATION FOR SEQ ID NO: 5: 

4 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

50 

<ii) MOLECULE TYPE: protein 



55 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Leu Cys Gly Glu Arg Glu Asp Arg Leu Glu Leu Leu Lys Leu Ser Thr 
1 5 10 15 

60 

Phe Ala Leu He He Thr Pro Gly Asp Pro Arg Leu Val lie Ser Ser 
20 25 30 
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Gly Cys Ala Thr Arg Leu Phe Glu Ala Leu Glu Val Gly Ala Val Pro 
35 40 45 

Val Val Leu Gly Glu Gin Val Gin Leu Pro Tyr Gin Asp Met Leu Gin 
5 50 55 60 

Trp Asn Glu Ala Ala Leu Val Val Pro Lys Pro Arg Val Thr Glu Val 
65 70 75 80 

10 His Phe Leu Leu Arg Ser Leu Ser Asp Ser Asp Leu Leu Ala Met Arg 

85 90 95 
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Arg Gin Gly Arg Phe Leu Trp Glu Thr Tyr Phe Pro Thr Ala Asp Ser 
100 105 110 

He Phe Asn Thr Val Leu Ala Met He Arg Thr Arg lie 
115 120 125 



UJ (2) INFORMATION FOR SEQ ID NO: 6: 

O 20 

k j% (i) SEQUENCE CHARACTERISTICS: 

. 5 ~_ (A) LENGTH: 120 amino acids 

JpJ (B) TYPE: amino acid 

^ (C) STRANDEDNESS: single 

l=J 2 5 (D) TOPOLOGY: linear 

Q (ii) MOLECULE TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



40 



45 



Arg Cys His Lys His Gin Val Phe Asp Tyr Pro Gin Val Leu Gin Glu 
35 1 5 10 15 

Ala Thr Phe Cys Val Val Leu Arg Gly Ala Arg Leu Gly Gin Ala Val 
20 25 30 

Leu Ser Asp Val Leu Gin Ala Gly Cys Val Pro Val Val lie Ala Asp 
35 40 45 

Ser Tyr lie Leu Pro Phe Ser Glu Val Leu Asp Trp Lys Arg Ala Ser 
50 55 60 

Val Val Val Pro Glu Glu Lys Met Ser Asp Val Tyr Ser lie Leu Gin 
65 70 75 80 

Ser lie Pro Gin Arg Gin lie Glu Glu Met Gin Arg Gin Ala Arg Trp 
50 85 90 95 

Phe Trp Glu Ala Tyr Phe Gin Ser lie Lys Ala lie Ala Leu Ala Thr 
100 105 110 

55 Leu Gin lie lie Asn Asp Arg lie 

115 120 

(2) INFORMATION FOR SEQ ID NO: 7: 

6 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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<D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Arg Cys Asp Arg Asp Asn Thr Glu Tyr Glu Lys Tyr Asp Tyr Arg Glu 
15 10 15 

Met Leu His Asn Ala Thr Phe Cys Leu Val Pro Arg Gly Arg Arg Leu 
20 25 30 

Gly Ser Phe Arg Phe Leu Glu Ala Leu Gin Ala Ala Cys Val Pro Val 
35 40 45 

Met Leu Ser Asn Gly Trp Glu Leu Pro Phe Ser Glu Val lie Asn Trp 
50 55 60 

Asn Gin Ala Ala Val lie Gly Asp Glu Arg Leu Leu Leu Gin lie Pro 
65 70 75 80 

Ser Thr lie Arg Ser lie His Gin Asp Lys lie Leu Ala Leu Arg Gin 
85 90 95 

Gin Thr Gin Phe Leu Trp Glu Ala Tyr Phe Ser Ser Val Glu Lys lie 
100 105 110 

Val Leu Thr Thr Leu Glu lie lie Gin Asp Arg lie 
115 120 

INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Arg Cys Glu Gin Asp Pro Gly Pro Gly Gin Thr Gin Arg Gin Glu Thr 
15 10 15 

Leu Pro Asn Ala Thr Phe Cys Leu lie Ser Gly His Arg Pro Glu Ala 
20 25 30 

Ala Ser Arg Phe Leu Gin Ala Leu Gin Ala Gly Cys lie Pro Val Leu 
35 40 45 

Leu Ser Pro Arg Trp Glu Leu Pro Phe Ser Glu Val lie Asp Trp Thr 
50 55 60 



Lys Ala Ala lie Val Ala Asp Glu Arg Leu Pro Leu Gin Val Leu Ala 
65 70 75 80 
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Ala Leu Gin Glu Met Ser Pro Ala Arg Val Leu Ala Leu Arg Gin Gin 
85 90 95 

Thr Gin Phe Leu Trp Asp Ala Tyr Phe Ser Ser Val Glu Lys Val lie 
5 100 105 110 

His Thr Thr Leu Glu Val lie Gin Asp Arg lie 
115 120 

10 (2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 amino acids 

(B) TYPE: amino acid 

15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

2 5 Lys Cys Ser Gin Glu Asn Cys Ser Leu Glu Arg Arg Arg Gin Leu lie 

15 10 15 

Gly Ser Ser Thr Phe Cys Phe Leu Leu Pro Ser Glu Met Phe Phe Gin 
20 25 30 



30 



45 



Asp Phe Leu Ser Ser Leu Gin Leu Gly Cys lie Pro lie Leu Leu Ser 
35 40 45 



Asn Ser Gin Leu Leu Pro Phe Gin Asp Leu lie Asp Trp Arg Arg Ala 
35 50 55 60 

Thr Tyr Arg Leu Pro Leu Ala Arg Leu Pro Glu Ala His Phe lie Val 
65 70 75 80 

4 0 Gin Ser Phe Glu lie Ser Asp lie lie Glu Met Arg Arg Val Gly Arg 

85 90 95 



Leu Phe Tyr Glu Thr Tyr Leu Ala Asp Arg His Leu Leu Ala Arg Ser 
100 105 110 

Leu Leu Ala Ala Leu Arg Tyr Lys Leu 
115 120 



(2) INFORMATION FOR SEQ ID NO: 10: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 262 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
55 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



60 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
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Val Pro Arg Glir Gin Phe Thr Val Val Met Leu Thr Tyr Glu Arg Glu 
1 5 10 15 

Glu Val Leu Met Asn Ser Leu Glu Arg Leu Asn Gly Leu Pro Tyr Leu 
20 25 30 

Asn Lys Val Val Val Val Trp Asn Ser Pro Lys Leu Pro Ser Glu Asp 
35 40 45 

Leu Leu Trp Pro Asp lie Gly Val Pro lie Met Val Val Arg Thr Glu 
50 55 60 

Lys Asn Ser Leu Asn Asn Arg Phe Leu Pro Trp Asn Glu lie Glu Thr 
65 70 75 80 

Glu Ala lie Leu Ser lie Asp Asp Asp Ala His Leu Arg His Asp Glu 
85 90 95 

lie Met Phe Gly Phe Arg Val Trp Arg Glu Ala Arg Asp Arg lie Val 
100 105 110 

Gly Phe Pro Gly Arg Tyr His Ala Trp Asp lie Pro His Gin Ser Trp 
115 120 125 

Leu Tyr Asn Ser Asn Tyr Ser Cys Glu Leu Ser Met Val Leu Thr Gly 
130 135 140 

Ala Ala Phe Phe His Lys Tyr Tyr Ala Tyr Leu Tyr Ser Tyr Val Met 
145 150 155 160 

Pro Gin Ala lie Arg Asp Met Val Asp Glu Tyr lie Asn Cys Glu Asp 
165 170 175 

lie Ala Met Asn Phe Leu Val Ser His He Thr Arg Lys Pro Pro He 
180 185 190 

Lys Val Thr Ser Arg Trp Thr Phe Arg Cys Pro Gly Cys Pro Gin Ala 
195 200 205 

Leu Ser His Asp Asp Ser His Phe His Glu Arg His Lys Cys He Asn 
210 215 220 

Phe Phe Val Lys Val Tyr Gly Tyr Met Pro Leu Leu Tyr Thr Gin Phe 
225 230 235 240 

Arg Val Asp Ser Val Leu Phe Lys Thr Arg Leu Pro His Asp Lys Thr 
245 250 255 

Lys Cys Phe Lys Phe He 
260 

INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 69 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Pro Gin Ser Gin Gly Phe Thr Gin lie Val Leu Thr Tyr Asp Arg Val 
1 5 10 15 

Glu Ser Leu Phe Arg Val lie Thr Glu Val Ser Lys Val Pro Ser Leu 
20 25 30 

Ser Lys Leu Leu Val Val Trp Asn Asn Gin Asn Lys Asn Pro Pro Glu 
35 40 45 

Asp Ser Leu Trp Pro Lys lie Arg Val Pro Leu Lys Val Val Arg Thr 
50 55 60 

Ala Glu Asn Lys Leu Ser Asn .Arg Phe Phe Pro Tyr Asp Glu lie Glu 
65 70 75 80 

Thr Glu Ala Val Leu Ala lie Asp Asp Asp lie lie Met Leu Thr Ser 
85 90 95 

Asp Glu Leu Gin Phe Gly Tyr Glu Val Trp Arg Glu Phe Pro Asp Arg 
100 105 110 

iJeu Val Gly Tyr Pro Gly Arg Leu His Leu Trp Asp His. Glu Ala Met 
115 120 125 

Asn Lys Trp Lys Tyr Glu Ser Glu Trp Thr Asn Glu Val Ser Met Val 
130 135 140 

Leu Thr Gly Ala Ala Phe Tyr His Lys Tyr Phe Asn Tyr Leu Tyr Thr 
145 150 155 160 

Lys Met Pro Gly Asp lie Lys Asn Trp Val Asp Ala His Met Asn Cys 
165 170 175 

Tyr Glu Asp lie Ala Met Asn Phe Leu Val Ala Asn Val Thr Gly Lys 
180 185 190 

Ala Val lie Lys Val Thr Pro Arg Lys Lys Phe Lys Cys Pro Glu Cys 
195 200 205 

Thr Ala He Asp Gly Leu Ser Leu Asp Gin Thr His Met Val Glu Arg 
210 215 220 

Ser Glu Cys He Asn Lys Phe Ala Ser Val Phe Gly Thr Met Pro Leu 
225 230 235 240 

Lys Val Val Glu His Arg Ala Asp Pro Val Leu Tyr Lys Asp Asp Phe 
245 250 255 

Pro Glu Lys Leu Lys Ser Phe Pro Asn He Gly Ser Leu 
260 265 

INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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10 



25 



40 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Pro Pro Ser Lys Phe Thr Ala Val lie His Ala Val Thr Pro Leu Val 
15 10 15 

Ser Gin Ser Gin Pro Val Leu Lys Leu Leu Val Ala Ala Ala Lys Ser 
20 25 30 

Gin Tyr Cys Ala Gin lie lie Val Leu Trp Asn Cys Asp Lys Pro Leu 
35 40 45 



Pro Ala Lys His Arg Trp Pro Ala Thr Ala Val Pro Val Val Val lie 
15 50 55 60 

Glu Gly Glu Ser Lys Val Met Ser Ser Arg Phe Leu Pro Tyr Asp Asn 

65 70 75 80 

2 0 He He Thr Asp Ala Val Leu Ser Leu Asp Glu Asp Thr Val Leu Ser 

85 90 95 



Thr Thr Glu Val Asp Phe Ala Phe Thr Val Trp Gin Ser Phe Pro Glu 
100 105 110 

Arg He Val Gly Tyr Pro Ala Arg Ser His Phe Trp Asp Asn Ser Lys 
p 115 t 120 125 

fT Glu Arg Trp Gly Tyr Thr Ser Lys Trp Thr Asn Asp Tyr Ser Met Val 

jZ 30 130 135 140 

Q Leu Thr Gly Ala Ala He Tyr His Lys Tyr Tyr His Tyr Leu Tyr Ser 

^ 145 150 155 160 

3 5 His Tyr Leu Pro Ala Ser Leu Lys Asn Met Val Asp Gin Leu Ala Asn 

165 170 175 



Cys Glu Asp He Leu Met Asn Phe Leu Val Ser Ala Val Thr Lys Leu 

180 185 190 

Pro Pro He Lys Val Thr Gin Lys Lys Gin Tyr Lys Glu Thr Met Met 

195 200 205 



Gly Gin Thr Ser Arg Ala Ser Arg Trp Ala Asp Pro Asp His Phe Ala 

45 210 215 220 

Gin Arg Gin Ser Cys Met Asn Thr Phe Ala Ser Trp Phe Gly Tyr Met 

225 230 235 240 

50 Pro Leu He His Ser Gin Met Arg Leu Asp Pro Val Leu Lys Asp Gin 

245 250 255 



Val Ser He Leu Arg Lys Lys Tyr Arg Asp He Glu Arg Leu 
260 265 270 

(2) INFORMATION FOR SEQ ID NO: 13: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 262 amino acids 
60 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



10 



25 



40 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Pro Glu Gly Arg Phe Ser Ala Leu lie Trp Val Gly Pro Pro Gly Gin 
1 5 10 15 

Pro Pro Leu Lys Leu lie Gin Ala Val Ala Gly Ser Gin His Cys Ala 
20 25 30 



Gin He Leu Val Leu Trp Ser Asn Glu Arg Pro Leu Pro Ser Arg Trp 
15 35 40 45 

Pro Glu Thr Ala Val Pro Leu Thr Val lie Asp Gly His Arg Lys Val 
50 55 60 

2 0 Ser Asp Arg Phe Tyr Pro Tyr Ser Thr lie Arg Thr Asp Ala lie Leu 

65 70 75 80 

Ser Leu Asp Ala Arg Ser Ser Leu Ser Thr Ser Glu Val Asp Phe Ala 
85 90 95 



Phe Leu Val Trp Gin Ser Phe Pro Glu Arg Met Val Gly Phe Leu Thr 
100 105 110 



Ser Ser His Phe Trp Asp Glu Ala His Gly Gly Trp Gly Tyr Thr Ala 
30 115 120 125 

Glu Arg Thr Asn Glu Phe Ser Met Val Leu Thr Thr Ala Ala Phe Tyr 
130 135 140 

3 5 His Arg Tyr Tyr His Thr Leu Phe Thr His Ser Leu Pro Lys Ala Leu 

145 150 155 160 

Arg Thr Leu Ala Asp Glu Ala Pro Thr Cys Val Asp Val Leu Met Asn 
165 170 175 



Phe lie Val Ala Ala Val Thr Lys Leu Pro Pro lie Lys Val Pro Tyr 
180 185 190 



Gly Lys Gin Arg Gin Glu Ala Ala Pro Leu Ala Pro Gly Gly Pro Gly 
45 195 200 205 

Pro Arg Pro Lys Pro Pro Ala Pro Ala Pro Asp Cys lie Asn Gin lie 
210 215 220 

50 Ala Ala Ala Phe Gly His Met Pro Leu Leu Ser Ser Arg Leu Arg Leu 

225 230 235 240 

Asp Pro Val Leu Phe Lys Asp Pro Val Ser Val Gin Arg Lys Lys Tyr 
245 250 255 



Arg Ser Leu Glu Lys Pro 
260 



(2) INFORMATION FOR SEQ ID NO: 14: 

60 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 270 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Ser Thr Met Asp Ser Phe Thr Leu lie Met Gin Thr Tyr Asn Arg Thr 
1 5 .10 15 

Asp Leu Leu Leu Lys Leu Leu Asn His Tyr Gin Ala Val Pro Asn Leu 
20 25 30 

His Lys Val lie Val Val Trp Asn Asn lie Gly Glu Lys Ala Pro Asp 
35 40 45 

Glu Leu Trp Asn Ser Leu Gly Pro His Pro lie Pro Val lie Phe Lys 
50 55 60 

Gin Gin Thr Ala Asn Arg Met Arg Asn Arg Leu Gin Val Phe Pro Glu 
65 70 75 80 

Leu Glu Thr Asn Ala Val Leu Met Val Asp Asp Asp Thr Leu He Ser 
85 90 95 

Thr Pro Asp Leu Val Phe Ala Phe Ser Val Trp Gin Gin Phe Pro Asp 
100 105 110 

Gin He Val Gly Phe Val Pro Arg Lys His Val Ser Thr Ser Ser Gly 
115 120 125 

He Tyr Ser Tyr Gly Ser Phe Glu Met Gin Ala Pro Gly Ser Gly Asn 
130 135 140 

Gly Asp Gin Tyr Ser Met Val Leu lie Gly Ala Ser Phe Phe Asn Ser 
145 150 155 160 

Lys Tyr Leu Glu Leu Phe Gin Arg Gin Pro Ala Ala Val His Ala Leu 
165 170 175 

He Asp Asp Thr Gin Asn Cys Asp Asp He Ala Met Asn Phe He He 
180 185 190 

Ala Lys His He Gly Lys Thr Ser Gly He Phe Val Lys Pro Val Asn 
195 200 205 

Met Asp Asn Leu Glu Lys Glu Thr Asn Ser Gly Tyr Ser Gly Met Trp 
210 215 220 

His Arg Ala Glu His Ala Leu Gin Arg Ser Tyr Cys He Asn Lys Leu 
225 230 235 240 

Val Asn He Tyr Asp Ser Met Pro Leu Arg Tyr Ser Asn He Met He 
245 250 255 

Ser Gin Phe Gly Phe Pro Tyr Ala Asn Tyr Lys Arg Lys lie 
260 265 270 



INFORMATION FOR SEQ ID NO: 15: 



26 



< i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 259 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Arg Gin Arg Glu Gin Phe Thr Val Val Leu Leu Thr Tyr Glu Arg Asp 
1 5 10 15 

Ala Val Leu Thr Gly Ala Leu Glu Arg Leu His Gin Leu Pro Tyr Leu 
20 25 30 

Asn Lys lie lie Val Val Trp Asn Asn Val Asn Arg Asp Pro Pro Asp 
35 40 45 

Ser Trp Pro Ser Leu His lie Pro Val Glu Phe lie Arg Val Ala Glu 
50 55 60 

Asn Asn Leu Asn Asn Arg Phe Val Pro Trp Asp Arg He Glu Thr Glu 
65 70 75 80 

Ala Val Leu Ser Leu Asp Asp Asp He Asp Leu Met Gin Gin Glu lie 
85 90 95 

lie Leu Ala Phe Arg Val Trp Arg Glu Asn Arg Asp Arg He Val Gly 
100 105 110 

Phe Pro Ala Arg His His Ala Arg Tyr Gly Asp Ser Met Phe Tyr Asn 
115 120 125 

Ser Asn His Thr Cys Gin Met Ser Met He Leu Thr Gly Ala Ala Phe 
130 135 140 

He His Lys Asn Tyr Leu Thr Ala Tyr Thr Tyr Glu Met Pro Ala Glu 
145 150 155 160 

lie Arg Glu His Val Asn Ser He Lys Asn Cys Glu Asp He Ala Met 
165 170 175 

Asn Tyr Leu Val Ser His Leu Thr Arg Lys Pro Pro He Lys Thr Thr 
180 185 190 

Ser Arg Trp Thr Leu Lys Cys Pro Thr Cys Thr Glu Ser Leu Tyr Lys 
195 200 205 

Glu Gly Thr His Phe Glu Lys Arg His Glu Cys Met Arg Leu Phe Thr 
210 215 220 

Lys He Tyr Gly Tyr Asn Pro Leu Lys Phe Ser Gin Phe Arg Ala Asp 
225 230 235 240 

Ser He Leu Phe Lys Thr Arg Leu Pro Gin Asn His Gin Lys Cys Phe 
245 250 255 



Lys Tyr Val 
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(2) INFORMATION FOR SEQ ID NO: 16: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



10 



(ii) MOLECULE TYPE: DNA (genomic) 



15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 
TTATGGCGAG TGACCCGACG TG 
2 0(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

3 5TTGCTAAAGT GAAGGAAGTT GG 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS : 

4 0 (A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

4 5 (ii) MOLECULE TYPE: DNA (genomic) 



50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

ACCCGACGTG ATCTGG 

(2) INFORMATION FOR SEQ ID NO: 19: 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
6 0 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO 

5AAGAGCTCCT GCAGCTGG 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: DNA (genomic) 



2 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO 

TTCTCGTTGC CCTCTCAC 

(2) INFORMATION FOR SEQ ID NO: 21: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

3 0 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
AT CAT CAAT C TGTCACG 

40 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 

4 5 (B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 

55 

ACTACGATGA CCGGATC 

(2) INFORMATION FOR SEQ ID NO: 23: 

60 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2 

10TTCCCTACCA GGACATGC 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

2 0 (ii) MOLECULE TYPE: DNA (genomic) 



25 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 

AACATGGCTG ACAACG 

(2) INFORMATION FOR SEQ ID NO: 25: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
3 5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 
TATTGGTGGT GGAGCTGG 

45 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



55 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 

60 

AATCCAGCCA TGGTCTCCTT GG 
(2) INFORMATION FOR SEQ ID NO: 27: 



30 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 
5. (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

10 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 
AGTCGATGCC ATTATTACCA GC 

15 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



25 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 

30 

TTCCTTCCTC ATCACAG 

(2) INFORMATION FOR SEQ ID NO: 29: 

3 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

40 

(ii) MOLECULE TYPE: DNA (genomic) 



45 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
AGGTCTGTGT ATGCACTTGT G 
50(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 
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AGTCGATGCC ATTATTACCA GC 
(2) INFORMATION FOR SEQ ID NO: 31: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 

(ii) MOLECULE TYPE : DMA (genomic) 



15 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 
TTCAAGGGTG TGGAGAG 
20(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

2 5 (C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

30 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 

3 5TTGGCTGAAA GCCAACAACC TG 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

4 0 (A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear . 

45 (ii) MOLECULE TYPE: DNA (genomic) 



50 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 

AACATGCACG CATCCACAGC 

(2) INFORMATION FOR SEQ ID NO: 34: 
55 ' 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
60 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) -SEQUENCE DESCRIPTION: SEQ ID NO: 34 

5 T T GTAACACA GCATGTGG 

(2) I N FORMAT I ON FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: DNA (genomic) 



2 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35 

GGTTCTGTCA GTATTAGCTG GG 
(2) INFORMATION FOR SEQ ID NO: 36: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

( c ) STRANDEDNESS : single 
3 0 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



35 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 



TTCCTCCCTC TGCTCATCCT C 

40 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

50 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 

55 

TTCCCACTCT GTCTCTC 



